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Abstract 

Following the work by Kitaev, Freedman and Wang [Q, Aharonov, Jones and Landau recently 
gave an explicit and efficient quantum algorithm for approximating the Jones polynomial of the plat 
closure of a braid, at the fcth root of unity, for constant k. The universality proof of Freedman, 
Larsen and Wang [5j implies that the problem which these algorithms solve is BQP-hard. The fact 
that this is the only non-trivial BQP-complete problem known today motivates a deep investigation 
of this topic. 

A natural question which was raised in [2J is the following. The results of actually gave efficient 
algorithms also in the case of asymptotically growing fc's - up to k which is polynomial in the size of 
the braid. However, the results of only imply universality in the case of constant k, via the Solovay- 
Kitaev theorem; The application of this theorem relies heavily on the fact that the generators of the 
groups in question are fixed. The question of the complexity of the problems with asymptotically 
growing k was thus left open. 

In this paper we resolve this question and prove that the Jones polynomial approximation prob- 
lem is BQP-complete also for asymptotically growing k (bounded by a polynomial). To do this we 
introduce some new techniques for analyzing universality in quantum computation, which enable us 
to apply Solovay-Kitaev indirectly. As a side benefit, we reprove the density theorem of [5j using 
quite elementary arguments; this hopefully sheds light on the reason that these problems are indeed 
quantum-hard. 
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1 Introduction 



What is the computational power of quantum computers? This question is fundamental both from a 
computer scientist as well as a physicist points of view. This paper attempts to improve our understanding 
of this question, by studying the only non-trivial BQP-completc problem known to us today: the problem 
of approximating the Jones polynomial. 

The Jones polynomial, discovered in 1985 0], is a very important knot invariant in topology. Its 
importance was manifested in connections to numerous areas in mathematics, from the statistical physics 
model known as the Potts model to the study of DNA folding. Among its many connections, an extremely 
important one was drawn by Witten in 1989 [5], to quantum mechanics; more precisely, to Topological 
Quantum Field Theory (TQFT). Witten showed how the Jones Polynomial and other topological invari- 
ants of links naturally appear in the Wilson lines of the SU(2) Chern-Simons Topological Quantum Field 
Theory. 

About a decade later, TQFT entered the scene of quantum computation, when Freedman suggested a 
computational model based on this theory [Hj. The works of Freedman, Kitaev, Larsen and Wang 0^11] 
showed an equivalence between the TQFT model and the standard model of quantum computation. 
On one hand, they gave an efficient simulation of TQFT by a quantum computer P^. On the other 
hand, they showed that quantum computation can simulate TQFT efficiently 0. These results draw 
interesting links between quantum computation and the Jones polynomial. The simulation of TQFT by 
quantum computers implicitly implies the existence of a quantum algorithm for the Jones polynomial 
approximation problem at the fifth root of unity e l27r / k : via the results of Witten; The simulation of 
quantum computers by TQFT implicitly implies the that the Jones polynomial approximation problem 
is BQP-hard. However, these important results were stated in TQFT language, and were not stated or 
proved explicitly in the above set of works. A clear statement of some of the algorithmic result, in a 
computational language, was given in but without an explicit algorithm. 

Only recently Aharonov, Jones and Landau [2j provided an explicit and efficient quantum algorithm 
for the problem of approximating the Jones polynomial at those roots of unity. The algorithm uses a 
combination of mathematical results of over 20 years ago due to Jones. In particular, the main ingredient 
of the algorithm is a unitary representation of the braid group called the path model representation, in 
which braids are mapped to operators acting on a Hilbert space, whose basis vectors are indexed by 
paths on a certain graph. This gave a simple to state algorithm for the problem, bypassing the TQFT 
language altogether. In the other direction, the universality proof due to can also be made explicit in 
the standard quantum model language, without referring to TQFT. To do this one needs to encode the 
space of n qubits into the space of paths; a simple way to do it, encoding one qubit in four steps of a 
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path, was independently discovered by by Kitaev UJ, and Wocjan and Yard jTU]. The latter also defined 
a universal set of gates that acts on these paths, and proved that certain additive approximations of the 
Jones polynomial are BQP-complete. 

The above two results together give an explicit proof that the problem of approximating the Jones 
polynomial at the fifth root of unity, and in fact, for any primitive root of unity e 2 ^ l ^ k , for constant 
k > 4, k 7^ 6, 10 is BQP-complete. This is the only non-trivial BQP-complete problem known today 1 . 
This fact highlights the importance of this problem in the context of the quantum computational power, 
and motivates deeper investigation of this topic. One natural direction to try to take is generalizations 
of the algorithms. Initial steps in this direction were made in |1 II HOj . Another interesting direction is 
to study the reasons for BQP-hardness, and its range of applicability. This is the path we take in this 
paper. 

One natural question is the following. It turns out that the algorithms given in the work of Aharonov 
et. al work not only for constant k, but also for asymptotically growing fc's. To be more precise, [3] 
gives an efficient quantum algorithm to approximate the Jones polynomial of an n strands braid with m 
crossings at a primitive root of unity e 27 "/ fe , where the running time of the algorithm is polynomial in 
to, n and k. The algorithm is therefore efficient even if k grows polynomially with n. On the other hand, 
we only know that the constant k case is BQP-hard. Therefore, in the following natural question was 
raised: what is the complexity of approximating the Jones polynomial for polynomially bounded fe? It 
was left open whether it is BQP-hard, doable in BPP, or maybe somewhere in between. 

In this paper we resolve this question, and show that it is also BQP-hard: 

Theorem 1.1 The problem of approximating the Jones polynomial of the plat closure of a given braid 
b, with to crossings, at e 27 "/ fc , where both m and k are polynomially bounded in n, (and k is some fixed 
function ofn), is BQP-complete. 

We outline the difficulties in the proof, and our methods to overcome them, below. We hope that this 
paper makes a significant step towards better understanding of the complexity of the Jones polynomial 
approximation problems, and of quantum computational complexity in general. In particular, we hope 
that the techniques we developed here would help in proving BQP-hardness of future quantum algorithms 
to follow. 

this paper we use the term BQP-complete loosely - in fact, this is a promise problem, and therefore is not complete 
in the usual sense. 
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1.1 Proof Outline 

Given an algorithm that calculates the Jones Polynomial of any link at e~ 2T "/ fc (for some integer k > 4 
and k ^ 6, 10) in polynomial time in the number of crossings in the link, and a classical Turing machine 
- we can simulate a Quantum computer efficiently. How is that possible? The key idea here, which is 
demonstrated in [3], is to use an intimate connection between two, seemingly distinct, worlds: links and 
unitary matrices. The connection is the so-called "path representation" which is a unitary representation 
of the braid group. For a fixed k, the fcth path model representation of the braid group maps every braid 
to a unitary operator on a Hilbert space spanned by paths on a certain graph which depends on k. In 
particular, each generator of the braid group (which is simply one crossings of two adjacent strands) is 
mapped to a certain (k— dependent) unitary operator on the space of paths. The BQP-hardness proof 
boils down to showing that a general quantum gate can be approximated efficiently using these unitary 
operators. 

Let us first review the easier constant k case. Here, the main difficulty in the proof is to show that 
the relevant set of operators is dense in some subgroup of the unitary group. Once this is shown, it is 
standard to apply the famous Solovay-Kitaev theorem to show that density implies efficiency. In other 
words, once the subgroup is dense, then Solovay-Kitaev give a method to approximate every gate in the 
quantum circuit by a short sequence of generators, and universality follows. 

But how does one prove the density? The starting point is the fact that Kitaev's four steps encoding 
encodes two-qubit gates into the unitary group operating on the space spanned by 14 paths. Density thus 
means that we can approximate any matrix in SUilA), using our fc-generators. The idea here is to first 
restrict attention to some two dimensional subspace, and show density in SU(2). This was essentially 
done by Jones |12| . We then gradually increase the dimensionality of the space on which we have density, 
to SU(1A), by adding one or more dimensions at a time; to this end we introduce two general and very 
useful lemmas: the bridge lemma l4~T1 and the decoupling lemma FOl This completes the density proof of 
the constant k case; we get an almost self-contained, fairly elementary proof. 

We would now like to move to the asymptotically growing k case. Here, however, there is a subtle 
point in the above line of arguments. Indeed, density still holds. But the step of density implies efficiency 
fails. The starting point of the Solovay-Kitaev theorem is the construction, using the set of generators, 
of an epsilon-net in the unitary group, where epsilon is some small enough constant. Such an epsilon-net 
is easy to construct, given a fixed set of generators that span a dense subgroup - essentially, brute force 
would do the trick. However, if k is asymptotically growing in n, the set of generators is no longer fixed. 
It is no longer clear that the very first step of the Solovay-Kitaev theorem, that of creating the epsilon-net, 
can be done efficiently. 
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We give here a very rough sketch of how we overcome this difficulty. We first choose a large enough 
but fixed kg. We now modify the fco-generators as follows: we leave their eigenvalues as they were, but 
change their basis to the basis of eigenvectors of the generators corresponding to the case k i— > oo. This 
change of basis turns out not to affect the fact that the set of generators generates a dense subgroup. 
Since the new set of generators is fixed (independent of n), we can find an epsilon-net which consists of 
these modified /co-generators simply by brute-force. The advantage of using the modified generators as 
the building blocks of our epsilon-net is that the eigenvectors of the modified generators are very close 
to the eigenvectors of generators corresponding to very large fc's. If we want to create an epsilon-net 
using fc-generators for very large fc's, we only need to approximate the modified generators, using these 
/c-generators. Since the eigenvectors more or less agree, we only need to approximate the eigenvalues, 
which can be done by taking the appropriate power of the relevant fc-generator. We get an efficient 
construction of an epsilon-net consisting of /c-generators. We can now apply the Solovay-Kitaev theorem 
using this net. 

We now proceed to the detailed proof, but before this, we first need to review some basic facts about 
the Braid group, and introduce the path model representation. 

2 Mathematical Background 
2.1 The braid group B n 

Loosely speaking, a braid is a set of n strands that connect two horizontal bars, such that each strand 
is tied exactly to one peg on the top bar and one peg on the bottom bar. When drawing the braid 
schematically on a paper, the strands may pass over and under each other, but at any point they must 
not be completely horizontal. Braids which can be deformed into each other without tearing any of the 
strands are considered identical. An illustration of a 4-strand braid is given in Fig. ^ 

The set of all braids with n strands forms an infinite and discrete group which is called the braid 
group B n . The product rule for 6162 is defined by placing the braid b\ above the braid bi and fusing the 
bottom of the b\ strands with the top of the &2 strands. The identity element is the braid with n straight 
lines that connect each peg at the bottom bar to its corresponding peg at the upper bar. 

In |13| . Artin proved that B n admits a finite presentation (the Artin presentation) with n— 1 generators 
{<7j} that satisfy the following constraints: 

GiUj — Oj(Ji for \i — j\ >2 , (1) 

CiUi+lCj = 0-i_j_l<7iOi + l . (2) 
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Figure 1: An example of a 4-strand braid 
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Figure 2: The generator ui in the braid group B4. 



Pictorially, <7 2 : is a braid that is identical to the unity braid in all strands except for the i and i + 1 strands 
which cross each other once (the i + 1 — > i strand goes over the i — > i + 1 strand) , connecting the lower 
i'th peg to the upper i + 1 peg and vice versa. An illustration of 02 is given in Fig. [21 It is an easy 
exercise to verify graphically that the braid generators indeed satisfy Q, ©■ 



2.2 The path representation 

The path representation is a unitary representation of the braid group B n that depends on an integer 
k > 3. The image of every b £ B n under this representation is denoted by p(b) and it acts on a finite 
Hilbert space. To understand the structure of this space, we introduce the graph Gk which is a set of fc — 1 
sites (vertices) and k — 2 edges that connect them. The sites are ordered from bottom to top one above 
the other, as described in Fig. To each site we assign a number according to its position; the number 
1 is assigned to the bottom site while the number fc — 1 is assigned to the top site. We then consider all 
possible n-steps walks (paths) over the graph Gk which start at site 1 and never leave Gk- With each 



Figure 3: The graph Gk 



path we associate a vector from the standard basis of B n by mapping the qubit f with a movement in the 
upper direction (i.e., moving from site i to site i + 1), and the qubit with a movement in the opposite 
direction. For example, the vector 1 101 1) represents the 4-steps walk 1^2^1^2^3. Of course, not 
all standard basis elements correspond to valid paths. For example, since all paths begin at the bottom 
site and we are not allowed to leave the graph, they must start with the bit 1. The subspace of all the 
legitimate n-steps paths is denoted by H n k. 

To define p, it is enough to specify its action on the braid group generators <7j, which we denote by 
Pi d = Pi a i)- We describe pi in terms of partial paths: if p is a bit string that corresponds to a path, then 
p\i denotes the first i — 1 bits of the string and p\ l denotes the last bits of the string starting from the 
i + 2 bit. For example, if p = "11100101", then p\ 3 = "11" and p\ 3 = "0101", hence p = p\ 3 10 p\ 3 . Next, 
we let Z{ denote the position on the graph after the path p\i was taken. So for p = "11100101", we have 
Z\ = 1, Z2 = 2, 23 = 3, 24 = 4, Z5 = 3 etc. Finally, we define 

9 = ir/k , (3) 
\ t d = sin(7r^/fc) = sm(£6) , (4) 
A d ^ ie-™' 2k = ie- 19 ' 2 . (5) 

Notice that definition (J2J is for I = 1, . . . , k — 1. For I < 1 or I > k — 1, we set Xi = 0. 

Using this notation we define the operators $^ with i = 1, . . . ,n — 1: for every initial/final strings pi, 



p\ we have 




(G) 
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|p|ilOp| l ) + 
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$i|p|illpf> 



(9) 



Then are given by 



def 

Pi = 



A$ ( + A" 1 ! = y4 _1 (A 2 $j + 1) = - e^ e $;) . 



(10) 



In |2j it is shown that that these operators indeed form a unitary representation of the braid group. 

Notice that the operators pi are invariant under the same subspaces of the $^ operators. Specifically, 
they multiply the vectors IpliOOpI 1 ), Iplillpl 1 ) by the phase A -1 , while mixing the vectors |p|il0p| l ), 
\p\i 01p\ l ). Therefore in the standard basis, pi breaks into one-dimensional and two-dimensional blocks 
(but notice that these are different blocks for different operators). The two-dimensional blocks mix the 
|p|il0p| l ), |p|i01p| 4 ) vectors whose corresponding paths end at the same point. It follows that if \p) 
corresponds to a path that ends at I, then pi \p) can be written as a linear combinations of vectors that 
correspond to paths that end at the site I. This is true for all pi, hence the path representation breaks 
into representations over subspaces that correspond to paths that end at a particular I. We denote these 
subspaces by H n ,k,t- 

2.3 From braids to links to Jones Polynomial 

As stated in the beginning of this section, the path representation provides a connection between the 
Jones polynomial of links to unitary matrices. We will now explore this connection in some depth. In 
particular, we will see that quantum mechanical expressions of the form (0|C/|0), which are sufficient to 
describe quantum computation, can be given in terms of the Jones polynomial. 

We first notice that a braid can be transformed into a link by connecting its open endpoints. Such an 
operation is called a closure, and here we focus on one particular closure: the plat closure. This closure 
is defined only for braids with an even number of strands. It is the link that is formed by connecting the 
top pegs with odd numbers with the peg to their right, and doing the same with the bottom pegs. The 
plat closure of a braid b € B n is denoted by b pl . Figure 0] shows the plat closure of the 4-strand braid 
from Fig. ^ It turns out that there is a very strong connection between the path representation of a 
braid and the Jones polynomial of its plat closure. 
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Figure 4: The plat closure of the 4-strand braid from Fig. ^ 



In it was shown that the Jones Polynomial of the plat closure of every b £ B n can be given by a 
"sandwich" product of the operator p(b) with a special vector \a) G Specifically, let V bP i(-) denote 

the Jones polynomial of b pl , and \a) be the "zig-zag" vector 

| a) = |10101 ... 10) , (11) 

that corresponds to a path in which the first step is up, the second is down, the third is up etc. Then 
according to [3], the following equality holds: 

(a\p(b)\a) = ^d-^i-Ar^^VMA- 4 ) ■ (12) 

w(b pl ) is the writhe of the link b pl , and V bP i {A~ A ) is the Jones polynomial of b pl , evaluated at A~ A = 
e -2iri/k^ rp ne exac |- definition of these two quantities is given in The other quantities in the 

formula are Xi and A which are defined in Eqs. Q |SJ) respectively, and N and d which are defined by 

fc-i 

N = ^A,dim(ff„, M ), (13) 

d d ^ f -A 2 - A- 2 . (14) 

Note that the writhe is a trivial function of a link, as it is basically a sum over all its crossings. The 
Jones Polynomial, on the other hand, is a very non-trivial function of the link; the best known classical 
algorithms for its calculation are exponential in the number of crossings. 

Notice also that these two quantities are only defined for oriented links, and therefore we must choose 
some orientation for W 1 to calculate them. It does not matter, however, which orientation we pick since 
the combination (— A)~ 3wi - bFl " > V bP i {A~ A ) is independent of the orientation (in agreement with the LHS of 



the equation). It is actually the Kauffman bracket (b pl )(A) of the link b pL - which is also a polynomial of 
the link. For further details we refer the reader to |14j . 

3 Simulating a quantum computer 

With the path representation at hand, together with Eq. (|12|) . we can devise a plan to simulate a quantum 
computer. The idea is straightforward: given a description of a quantum circuit U over n qubits, which 
is a poly(n) sequence of unitary gates from a universal basis, we find a braid b such that 

(a\p(b)\a) ~ (0|[/|0) . (15) 

Here the ~ sign means that we can approximate the RHS by the LHS to any desired accuracy. Thus 
if we have an oracle that, given b, tells us the Jones polynomial V bP i(e~~ 27:i / k ), then from Eq. I|12|) we 
can easily find the LHS of Eq. (|15JI . which approximates (0|t/|0). This is sufficient to simulate quantum 
computation because of the following reasoning. Suppose we want to know the probability that a given 
quantum circuit Q outputs in its last qubit when applied on an input string X and then measured. We 
replace Q by a circuit U with one more qubit, which starts by applying NOT gates on the qubits that 
are 1 in the input string X, then applies Q, then copies the last qubit of Q to the additional qubit by a 
CNOT gate, and then applies Q in reverse, and also reverses the initialization of the input qubits to get 
0. It is a simple algebra exercise to show that (0|f |0) is equal to the probability to get in the original 
circuit Q applied to X. 

The algorithm for finding the braid b, must be polynomial in n (the number of qubits in U) if we want 
our simulation to polynomial. Additionally, if we assume that we can calculate the Jones polynomial in 
a time which is polynomial in the number of its crossings, then we must also assure that the number of 
crossings in b is polynomial in n, in order to keep the overall simulation polynomial. 

3.1 The 4-steps encoding 

We would now like to find a braid b that corresponds to the operator U as in Eq. Ijl5(l . We first simplify 
the matrices we deal with. It is easy to verify that any quantum circuit can be translated, with a linear 
overhead, to an equivalent quantum circuit which acts on qubits arranged in a one-dimensional array, 
and applies gates only on nearest neighbor qubits. We therefore assume that U is given by a product 
of poly(n) basis elements U = Ul • ■ ■ ■ • U\, (L — poly(n)) each of them acting non-trivially only on two 
adjacent qubits. 

The next step is to encode U in path space as it is the space on which the operators p(b) act. One 
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such convenient encoding represents every qubit by a 4-steps path: 

|0) d = |1010) (16) 
|1) = f |1100) . (17) 

Then a string of n encoded qubits is encoded as a 4n-steps path in H^ n} k, and every gate U is encoded 
as a 2 4 " x 2 4 ™ matrix U: 

e = ^^|i)(2l+love rt e StafS p S ce • (18) 

hj 

Here |i) denotes the 4n-steps path that encodes the qubits in the original vector Notice, however, 
that this is not an arbitrary path in H^ n> k as it returns to the initial site every 4 steps. We denote the 
subspace that is spanned by all these paths by S. 

Next, the product U = Ul ■ . . . - U\ naturally translates to U = U L ■ . . . ■ U_ l and so by finding braids 
hi e Bi n such that p(bi) ~ U_ i and then taking their product b = bjj ■ . . . ■ b\, we will get p{b) ~ U_. 
Consequently we would have: 

(0®»|[/|0®") = (0®™|C/|0® n ) ~ (0®"b(&)|0®™) = (a\p(b)\a) . (19) 

In fact, we will not be so ambitious; we will only require that p(bi) ~ U_ i on the subspace S, and show 
that this suffices. 

The advantage of using this particular encoding is that, together with the tensorial structure of the 
qubits, it allows us to concentrate on the "reduced" braid group Bg instead of the larger group B± n . 
Let us explain exactly what is meant by that. Suppose that we wish to perform an operation on the 
s, s + 1 encoded qubits of some path \p) S S. Then we must use a braid b G B^ n that mixes the 8 strands 
4(s — 1) + 1 — > 4(s + 1) while being trivial on the rest. However, since \p) £ S, its path reaches the first 
site before the 4(s — 1) + 1 and 4(s + 1) + 1 steps. Therefore the three partial paths that are defined by 
the steps 1 — > 4(s — 1), 4(s — 1) + 1 — » 4(s + 1) and 4(s + 1) + 1 — > 4n are all legitimate paths over the 
graph Gk (i.e., they start and end at the first site and never leave G&). We denote these partial paths 
by Po,P and pi respectively, and write \p) = \po) <S> \p) <S> \pi)- Notice also that \p) g Hg,k,i- We will add 
a tilde to all vectors and operators that act on that space. In particular, we define b £ B$ to be the 
"reduced" version of b, created by the 8 non-trivial strands of b. 

It is now easy to verify that 

p(b)\p) = p(b) (bo) O \P) ® |pi>) - bo) <8> (p(6)b)) ® bi) . (20) 

This follows from the definition of the generators pi in Eqs. ()6I9I 110(1 which only depend on c, - the 
position of the path after i — 1 steps, and not on the index i itself. 
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By linearity, we can extend Eq. 1121 III to all vectors in S, which are simply superpositions of encoded 
paths. Therefore, as long as \p) £ S, it is enough to search for an appropriate braid in the much simpler 
group, B%, instead of looking in the full -64™ group. What remains to show is that (i) we can approximate 
any operator on -ffs.fc,i using a b £ B% (and that this can be done efficiently), and (ii) that the state we 
work with is always sufficiently close to the subspace S where Eq. 1|2U[1 is valid. The next theorem and 
its subsequent claim show exactly that. 

Theorem 3.1 (Density and efficiency in Bg) Let U_ be an encoded two-qubit quantum gate, and let 
5 > 0. Then there exists a braid b £ Bg, consisting of poly (fc, 1/5) generators of B%, such that 

\\(p(b)-U)\P)\\<5 , 

for every \p) £ i?8.fc,i; provided that k > A, k 7^ 6, 10. Moreover, b can be found in poly(k, 1/(5) time. 

We will devote most of the remainder of the paper to proving this theorem. Before we start, however, let 
us see how, together with Eq. (|2(J[1 , it can be used to construct the appropriate braid b £ B^ n in Eq. I|19|) . 

Let U = Ul ■ . . . ■ Ui with Ui being local two-qubit gates, and let e > be an arbitrarily constant. 
For every Ui we use the theorem to construct a braid bi £ Bg, with 5 = e/ L, and extend it into a braid 
bi £ B n by adding identity strands at the appropriate places. Finally, b is taken to be the product of 
these bi's. We have 

Claim 3.1 \\U L -U L _ 1 ---U 1 \a)-p(b L )-p(b L . 1 )---p(b 1 )\a)\\ < e. 
Proof: The claim is easily proved by induction. Indeed, assume that 

\\U i _ 1 ---U 1 \a)-p(b i . 1 )---p(b 1 )\a)\\< t -jle, (21) 

and define \(3) = f U_i-\ 1 ' 'IZil^)- It is eas Y to verify that any encoded gate U_ sends the subspace S into 
itself and therefore \f3) £ S. Consequently 

\\U_M -p{bi)\0)\\ - \\UM - P (k)\f3)\\ < j-e , (22) 

where the first equality follows from the reduction in Eq. (|20Jl and the second inequality follows from the 
way in which we constructed bi £ B§. Then using the induction assumption together with the triangle 
inequality, we get 

||C£ i ---2 1 |a)-p(6 i )-"p(6 1 )|a)|| (23) 

= Um - P(bi)\(3) + P&m^ ■■■U 1 \a)- p(bi) ■ ■■p(h)\a)\\ 

< liai/?) - P(h)\(3)\\ + ■■■U 1 \a)~ P(bi-i) • • • p(h)\a)\\ 

e . e e 

* L +(l - 1 h = l L- 
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□ 

This shows that the braid b = OL---b\ satisfies (a\p(b)\a) ~ (a|[/|a) as required by Eq. (|15|l and 
Eq. HI 9(1 ■ By theorem 13.11 the number of braid generators that are needed to approximate every gate is 
of the order poly(k, 1/(5), and they can be found in time poly(k, 1/6). Since we have L = poly(n) gates 
then the total number of generators is poly(n, k, 1/8). Finally, as we assume that k = poly(n) then the 
total number of generators is poly(n, 1/8). This is exactly the number of crossings in the resultant link 
since every generator contains exactly one crossing. We have thus proved Theorem ll.il which states that 
approximating the Jones polynomial of a plat closure of a braid with m crossings in e 2 W fc ; with both m 
and k polynomially bounded in n - is BQP-complete. 

In the next section we will prove theorem 13. II - the B$ density and efficiency theorem. We first show 
that the images of B$ under the path representation form a dense subset of SU(Hs,k,i)- This, together 
with the Solovey-Kitaev theorem would prove theorem 13.11 for constant k. We complete the proof in 
Sec. 14.31 where we extend the theorem to the general, k = poly{n), case. 

4 Proving the Bg density and efficiency theorem 

To prove theorem 13 . II let us first analyze the structure of the subspace Hg t k,i and the generators p%, . . . ,p-j 
of the Bg path representation that act on it. 

4.1 The structure of the generators in H 8jki x 

In Sec.lOwe saw that the generators break into 2-dimensional and 1-dimensional blocks when represented 
in the standard basis. Let us look at these blocks in some more detail. 

Consider first the action of pi on vectors with zi = 1: here \ Zi _i = and consequently \p\i 10p\ l ) is 
an eigenvector of $i : 

$i\p\i 10pP) = ^-\p\i 10p\ l ) = 2co8e\p\i 10 p\*) . (24) 
It is therefore an eigenvector of pi with eigenvalue 

A- 1 (l-2e-* e cos9) = -A- 1 e- l2t> . (25) 

The vectors \p\i 01p\ l ), \p\i 00p\ l ) do not exist since they represent a path that leaves the graph. Finally 
the vector \p\i 11 is nullified by <i>i and is therefore also an eigenvector of pi with the eigenvalue A^ 1 . 
Next, consider the z, > 1 case. Here $i nullifies the |p|j00p| 8 ), |p|illp| l ) vectors while mixing 
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\p\i 10p\ l ), \p\i 01p\ l ). Its matrix in that block is given by 




-i 



) 



A z . 



(26) 



It has two eigenvalues: and 2 cos 6*, and consequently the eigenvalues of pi in that block are A -1 and 
— A~ 1 e~ lW - the same eigenvalues of the Zi = 1 case. In fact, it is not hard to see that all the pi operators 
are equivalent to each other. 

The 2x2 matrix that transforms the standard basis elements {\p\i 10, p\ l ), \p\i 01p| 4 )} to the eigen- 
vectors basis is 



The other vectors {\p\i 00p\ l ), \p\i llp\ 1 )}, if exist, are trivial eigenvectors of pi with eigenvalue A -1 . 

Finally, we note that for k > 5, i?s,fc,i consists of exactly 14 paths 2 and hence it is a 14 dimensional 
space. These paths are shown graphically in Fig. [5] We identify each of them with a basis vector and 
label them by the numbers 1, . . . , 14. Using this labeling we write down the block structure of the seven 
generators pi,...,pf in Tabled For each operator, the table lists the non-trivial blocks (i.e., blocks 
where $i does not vanish). These are specified by brackets that surround the vectors that define them. 
The one-dimensional blocks correspond to the Zi = 1 case while the two-dimensional blocks correspond 
to the zi > 1 case. 

4.2 Proving the density 

We will now prove the density part of Theorem 13.11 We will show that the seven operators pi can 
approximate any special unitary matrix on iT8,fe,i> provided that k > 4 and k ^ 6,10. As it is a 14- 
dimensional space, we are interested in matrices U S S'J7(14) 3 . 

We begin by considering the action of p\ and pi on this subspace. From Table ^ we see that these 
operators act non-trivially on the five 2x2 blocks {1, 2}, {3, 4}, {5, 6}, {7, 8}, {9, 12}, while applying the 

2 For k = 5 there are actually only 13 paths, as path 14 is illegal (it gets out of the graph). Nevertheless, it is easy to see 

that the density proof still holds in that border case. For k = 4 we cannot prove density (see theorem 14.11 while for k < 4 

the H% fc i is too small to encode 2 qubits. 

3 For k = 5 we look at SU(13) and ignore the vector 14 




(27) 



Inside that subspace we have 




(28) 
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Figure 5: The 14 different vectors that correspond to paths on 8-strands, starting at 1 and ending at 1 



Pi ■■ 


(1) 


(3) 


(5) 


(7) 


(9) 


P2 ■ 


(1.2) 


(3,4) 


(5,6) 


(7,8) 


(9,12) 


P3 ■ 


(1) 


(3) 


(6,10) 


(8,H) 


(12,13) 


Pi ■ 


(1,5) 


(2,6) 


(3,7) 


(4,8) 


(13,14) 


P5 ■ 


(1) 


(2) 


(7,9) 


(8,12) 


(11,13) 


P6 ■ 


(1,3) 


(2,4) 


(5,7) 


(6,8) 


(10,11) 


Pi ■ 


(1) 


(2) 


(5) 


(6) 


(10) 



Table 1: The block structure of the generators of B 8 in H 8 ,k,i for k > 5. 
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trivial A -1 phase on the rest. In these blocks, the p\ operator is represented by (i), whereas the p 2 
operator is represented by (i, j). Additionally the action of the operators on all five blocks is equivalent. 
The following theorem assures us that in each such block we may approximate any SU (2) matrix. 

Theorem 4.1 (Jones |12j) If k > 4, and k ^ 6, 10, then in each 2x2 block, the group that is generated 
by p\ and pi is dense in SU(2). 

Proof: Given in appendix lAl 

Next, consider what happens when we are also allowed to act with p%. Looking at Tabled we see 
that the resulting operators are diagonal on the blocks {1,2}, {3,4}, {5,6,10}, {7,8,11}, {9,12,13}. 
Obviously, we can still approximate any SU(2) matrix in the 2x2 blocks. The next lemma provides 
a way to increase the dimensionality of the space on which we have density, in the following way: We 
start with SU(A) and SU(B), for two orthogonal subspaces A, B. If we also have a bridge, namely, some 
operator which mixes the two subspaces, then we have density in SU (A(B B). This general lemma is very 
reminiscent of a lemma which appeared in an early version of |15j . Its proof uses a combination of ideas 
by Aharonov and Ben-Or |15| and from Kitaev |16| . 

Lemma 4.1 (The Bridge Lemma) Consider a linear space C which is a direct sum of two subspaces 
A and B, and assume that dim_B > dim A > 1. Then any U £ SU{C) can be approximated to an 
arbitrary precision using a finite sequence of transformations from SU{A), SU(B) and any transformation 
W £ SU(C) that mixes the two subspaces. Consequently, the group generated by SU(A), SU(B) and W 
is dense in SU(C). 

Proof: Given in appendix lAl 

The bridge lemma implies that it is also possible to approximate any SU (3) matrix in the 3x3 blocks. 
As an example, consider the {5,6,10} block. From Theorem 14.11 we already know that we are able to 
approximate any SU(2) transformation on the {5, 6} block, and now we have the transformation p% that 
mixes this subspace with the one-dimensional subspace of that is spanned by tenth vector. Lemma |4.1I 
therefore guarantees that together they can approximate every transformation in SU(3). 

In the above reasoning there are two small cavities that are worth mentioning, since they will appear 
in the rest of the proof. Firstly, the mixing transformation p^ does not belong to SU(3). This, however, 

def 

is not a real problem, as we can always consider the transformation p^ — cp^ with c some phase such 
fixes p 3 in SU(3). Then {p l ,p 2 ,p i ) is dense in SU(3), and since [SU (N) , SU (N)} = SU{N) then also 
[(pi,p2,P3),(pi,P2,Pa)] is dense in SU(3). But the last group is equal to [(pi,p2,Pa), {px,p2,Pi)\ since 
the group bracket cancels out the phase c and therefore also {pi,p2,P3) is dense in SU(3). 
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Secondly, we know we can approximate any transformation in SU(2) while Lemma 14 . 1 1 assumes that 
we can get any transformation in SU(2) precisely. But since the approximation is made of a finite product 
of operators, all of which can be approximated as accurately as desired by p\,p2,ps, it follows that we 
can also approximate any transformation in SU(3) to any desired accuracy. 

Naturally, the next step is to consider what happens when we are also allowed to act with p^. 
From Table^ we see that resulting transformations will be invariant under the subspaces {1,2,5,6, 10}, 
{3,4,7,8,11}, {9,12,13,14}, that together make up the entire 14-dimensional subspace. We can use 
Lemma |4~T1 again to learn that we can approximate any SU(A) transformation in the {9, 12, 13, 14} block. 
But what about the two other, five-dimensional blocks? There we cannot use Lemma 14 . 1 1 directly. To un- 
derstand why this is so, consider, for example, the subspace {1, 2, 5, 6, 10}. We know that using p\, p2, P3 
we can approximate any SU(2) transformation on the {1, 2} block and any ST/(3) transformation on the 
{5, 6, 10} block. We also know that that p4 mixes these two blocks. However, to use Lemma |4~T1 we must 
be able to approximate the SU (2) transformations independently of the SU (3) transformation. In other 
words, we must be able to approximate an SU(2) transformation on the subspace {1,2}, while leaving 
the subspace {5, 6, 10} invariant and vice versa. But this is not a priori true since the transformations on 
{1,2} are generated by some sequence of the operators pi,p2,P3, which simultaneously generates some 
transformation on {5, 6, 10}. Luckily, we can use the fact that the dimensionality of the two subspaces is 
different in order to prove that such decoupling is possible: 

Lemma 4.2 (The Decoupling Lemma) Let G be an infinite discrete group, and let A, B be two finite 
Linear spaces with different dimensionality. Let T a and Tf, be two homomorphisms of G into SU(A) and 
SU(B) respectively and assume that T a (G) is dense in SU(A) and Tb(G) is dense in SU{B). Then for 
any U £ SU(A) there exist a series {cr n } in G such that 

Ta{<Jn) - U (29) 

n{a n ) -» 1 , (30) 

and vice versa. 

Proof: Given in appendix 1X1 

It is therefore clear that we are able to approximate any SU(5) transformation on the {1, 2, 5, 6, 10} 
and {3, 4, 6, 7, 8, 11} blocks. Using p$ we can now mix the {3, 4, 7, 8, 11} subspace with the {9, 12, 13, 14} 
subspace, and using the fact that their dimensionality is different, together with Lemmas 14 . 1 1 14 . 2l - we are 
guaranteed that we can approximate any SU (9) transformation on the combined 9-dimensional subspace. 

Finally, by using p^, we mix the five-dimensional block {1, 2, 5, 6, 10} with the nine-dimensional block 
from above - thereby approximating any transformation in SU(1A). This completes the density proof. 
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4.3 Proving the efficiency for the k = const and k = poly(n) cases 

Having proved that the operators pi, . . . , pi can approximate any matrix U € SU (14), the efficiency of the 
process for a constant k follows easily. We want to be able to approximate any U S SU(1A) up to some 
error 5 using poly (1/5) operators in poly (1/5) steps. This, and actually much more, can be achieved using 
the Solovay-Kitaev algorithm ^Bj. The algorithm relies on the fact that pi,...,pj are dense in SU(14), 
and provides us with a ^-approximation to U that consists of no more than poly(\og(5~ 1 )) operators in 
poly(\og(5^ 1 j) steps! 

One should be aware, however, that the Solovay-Kitaev algorithm contains an initial step, where an 
e-net is constructed. This is a finite set of operators that is generated by px, . . . , p? and has the property 
that every operator in S'[/(14) is closer than e to at least one of the elements of the net. e is a finite 
constant which is unrelated to the target accuracy 5, and whose actual value is of the order 10~ 2 (see, 
for example, Ref \17\). The existence of such a net is guaranteed since we know that pi, . . . , pi generate 
a dense set in SU(1A)\ its construction takes a constant time. We note that this constant may be very 
large since we might need to use brute force. Fortunately, the same e-net can be used in all calculations, 
and we can treat its construction as a pre-computational stage. 

The situation becomes more tricky when we let fc be poly normally dependent on n. In such case k, 
and consequently the operators pi,...,pj are no-longer constant, and a new e-net has to be reconstructed 
for every n. Then we can no longer treat the e-net construction as a constant step, and its complexity 
must be taken into account. The question is therefore whether we can still guarantee that the overall 
computational cost is polynomially bounded in k and in log(^ -1 )? The answer is yes. The idea is to 
construct an e-net for some fco and then use it to efficiently generate e-nets for any large enough fc. 

We begin by considering the k — > oo limit of the pi operators. From Sec. 14.11 we recall that in the 
standard basis these operators decompose into 1 x 1 or 2 x 2 blocks. The diagonalizing matrix of the 
2x2 blocks is Vk(z), given by Eq. (|27|l . Here, and in what follows, we explicitly added the subscript k 
to V(z) to indicate its dependence on k. In the limit k — > oo, we have 9 — » and we get 



We now pick a finite fco - say, ko = 7 - and construct a new set of generators {pi,p2, ■ ■ ■ , P7} by going 
through the 2x2 blocks and replacing each finite V(zi) by Voo(zi) while keeping the same eigenvalues. It 
is easy to see that also this set of operators generates a dense set in SU(14). Indeed, the density proof in 
Sec. 14.21 remains valid since it only relies on the eigenvalues of the generating operators and on the fact 
that for z > 1, V(z) mixes the two standard basis vectors. We will thus generate an e-net from {pi} and 
use it to generate the e-net of {p{\ for large fc's. The idea is simple: for large enough fc, Vk(z) is close 




(31) 
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to Voo(z), and at the same time the eigenvalues of the {pt} are roughly e 27ri / fc °, whereas the eigenvalues 
of {pi} are roughly e 2 W fc . Therefore we can tentatively approximate every pi by p\ k ^ , and obtain the 
desired e-net. The exact formulation of this idea is contained in the following Lemma: 

Lemma 4.3 Let E be an e/2-net, generated from {pi}, and assume without loss of generality that each 
element in E is a group commutator (this is possible since SU(14) is a simple Lie-group and therefore 
[SU (14), SU (14:)} = SU(14)). Then for large enough k, by replacing every occurrence of pi in E by p\ m , 
with m = 0(1/ k), we obtain a new net, Ek, which is an e-net. 

Proof: 

Let d be the maximal number of generators that are needed to construct an element in E. We 
wish to be able to approximate any pi up to at least e/2d using pi. The first thing we take care of is 
that Vk(z) will be close enough to Voo(z). We therefore pick an integer K\ such that for any k > K\, 
\\V(z)-V oc (z)\\<e/(6d). 

Next, we must find a Ki such that for any k > Ki, the eigenvalues of pf m will be close enough to pi, 
for some yet to be determined m. This is more conveniently done by defining 

Pi = A(k )pi , Qi [A(k)p t ] 2 , (32) 

and approximating the operators Pi with Qi. In the end, the factors A(ko), A(k) will cancel out when we 
plug these operators to the group commutator of each element in E. The logic behind these definitions 
is that these factors cause one of the eigenvalues of both Pi and Qi to be exactly one (see Eq. l)l()fl L and 
therefore we only have to match the remaining eigenvalues. Indeed, the non-trivial eigenvalue of Pi is 
e -i7r(2+fe )/fco ; whereas the non-trivial eigenvalue of Qi is e _47r2 / fc . We therefore define 

(2 + k )/k Q 



711 



(33) 



e — i-n(2+k )/k _ ^-Amni/k 



4/fc 

and let Ki be such that for every k > K2 

' < e/(6d) . (34) 

It is easy to see that it is enough to take K2 > ko for which |e~ 47 ™/ X2 — 1| < e/(6d). 

Assume then that k > max(ifi, K2) and let us estimate the distance between Pi and Qf 1 . This is 
the maximal distance between the corresponding blocks in the standard basis. In the lxl blocks both 
operators have an eigenvalue 1 and therefore the distance is zero. In the 2x2 blocks we have 

e ~iir(2+k )/k Q 

1 

g— 4m7ri/fc q 

1 





(35) 
(36) 
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and consequently 



g — i7r(2+fc )/fco _ e -4m7ri/fc 



+ \\V oc {z)-V{z)\\<e/{2d) . (37) 

therefore ||P-Q m || < e/(2d). 

Let us now return to the P-net and create the Ek net. Any element in E is a commutator of products 
of f>i, and therefore remains unchanged if we replace pi — > Pj, because the phase factors cancel out in 
the commutator. The distance of this product from a product in which we replace p — > Q™ 1 is smaller 
than e/2 since ||Pj — Q™|| < e/(2d) and we have at most d terms in the product. The Q™'s product is 
unchanged upon the replacement Q™ — » {pi) 2m (again, the phase factors cancel out), thereby creating 
the Ek net. Hence any element in E can be approximated up to a distance e/2 by an element of Ek- It 
follows that Ek is an e-net. □ 

It follows that we can create an e-net from the operators pi, and because m < k, the number of steps 
that are needed to create this net is bounded by poly(k). The next step would be the application of the 
Solovay-Kitaev algorithm to approximate any transformation U £ 577(14) up to an error 6 - and so the 
overall computational cost is bounded by poly{k, 1/5) as required. 



A Proofs 

A.l Proof of Theorem 14.11 

Proof: 

Since pi and P2 are not in SU(2), we will look at their images under the canonical homomorphism 
U(2) — ► 577(2) which takes V £ U(2) to (det V)~ 1 ' 2 V, and prove that these images form a dense set in 
577(2). Then using the fact that [577(2), SU(2)} = 577(2) it will follow that also p x and p 2 generate a 
dense set in 577(2). 

We first use the fact that group which is generated by pi, p 2 is infinite as long as k > 2 and k ^ 4, 6, 10. 
This results was proved by Jones 4 in 1983 and appears in Theorem 5.1 page 262 in ref ^2). It uses 
the canonical homomorphism between 577(2) and 5*0(3) and the well known classification of all finite 
subgroups in 50(3). 

To approximate any element in 577(2) to within an e, we pick two matrices in g\,gi £ G such that 
||.9i — .^72 1 1 < e/3 (we can do that since G has an infinite number of elements and 577(2) is compact), and 

4 The theorem also shows that for k = 10, p{B n ) is infinite provided that n > 3. This leads us to believe that the Bg 
density and efficiency theorem is also valid for k = 10 - however this does not follow from our proof. 
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we define g = gig^ 1 ■ Then obviously \\g — 1|| < e/3. Consequently, if e ±lA are the eigenvalues of g then 
| e ±«A _ i| < f 73^ Also g must be non-commuting with at least one of the matrices p\ or p2 which we shall 
denote by T. 

Let U be the diagonalizing matrix of g: g = C/~ 1 diag{e lA , e~ lX }U, and define the two continuous 
families of matrices 

R(<f>) ^ CZ-Miagje^, e'^U , (38) 
S(</>) d = a- x R{cj))a . (39) 

Then it is easy to see that any matrix V G SU(2) can be presented as the product R(a) S (f3) R(j) for a 
suitable choice of a, j3 1 7 € R (see, for example, Kitaev JS]). But since |e lA — 1| < e/3 then any member 
in the families R(-), S(-) can be approximated by multiplications of g and a up to a distance of e/3, and 
therefore the multiple R(a) S \[3) R("f) can be approximated to within e. □ 

A. 2 Proof of Lemma 14.11 (The Bridge Lemma) 

To prove lemma PPl we first need to prove the following two lemmas: 

Lemma A.l Consider a linear space C which is a direct sum of two subspaces A and B such that 
dim_B > dim A > I, and let W € SU(C) be some transformation that mixes the two subspaces. Then 
for every pair of normalized vectors |^) G C , we can approximate a transformation T^,^^ G SU(C) 
such that T^-f^ip) = \4>) , to any desired accuracy using a finite product of transformations from SU{A), 
SU(B) and W. 

Proof: 

Instead of approximating the transformation T,/,—^ for any two vectors \ip), |</)), we will approximate 
a transformation that transforms a particular vector \v*) to an arbitrary vector \ip). Then obviously, 

Ti rri rri — 1 

4>^<t> - 1 'P 1 if, ■ 

We begin by finding a vector \v*) € B for which VFlw*) € B. Such vector must exist since dimi? > 
dim A. Indeed, let . . . , \v n ) be a basis of B. Then W\v t ) = a^) + (5^) , with \u'^ S A and S B. 
Then since dimi? > dim A, the \u'j) vectors are linearly dependent and we can find a non-trivial linear 
combination such that X^^^l^i 

) = 0. Then the vector \v*) = £ Ci\Vi) is in B and W^v*) has no 

projection on A. 

Next, we pick a vector \u*) E A for which IT^u*) has some projection on B. Such vector must exist 
because if WA C A then by unitarity WA = A. This implies WB = B, again by unitarity - contradicting 
the assumption that W mixes the two subspaces. 
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Most generally, W\u*) = a\u) + b\v) with \u) e A, \v) e B and \a\ 2 + \b\ 2 = 1. If a ^ 0, we find 
a transformation U G SU(A) that takes |it) to and define W = UW, otherwise, we set W — W. 
We have thus constructed a transformation W for which T^l?;*) £ B and W^w*) = a\u*) + b\v) for some 
< \a\ < 1. 

Now let = a\uo) + (3\vq) be an arbitrary vector, with |wo) £ A and |«o) € -B. We will now 
apply a series of unitary operations that will take closer and closer to \v*}. We start by moving \uq) 
to \u*) and \vq) to \v*) using transformations from SU{A) and SU(B) respectively, yielding the vector 
= a\u*) + P\v*). Using W we obtain 

\tp[) = W\tpi) = aa\u*) + ab\v) + W\v*). (40) 

As € B, we can now apply a transformation from SU(B) that takes ab\v) + W\v*) to C2\v*). 

Here C2 is the norm of ab\v) + W^lv*), which is usually smaller than one. We obtain 

|V> 2 } =aa\u*) +c 2 \v*). (41) 

Notice that comparing \ip2) to w e see that we managed to move some of the weight from \u*) to 

\v*}. We iterate this process. We get \tp' 2 ) by applying the Wi transformation on \ip2)> an d obtain ^3) 
by moving the B part of \ip' 2 ) to \v*). After n such iterations we obtain 

\^ n ) = aa n - l \u*) + c n \v*), (42) 

and since \a\ < 1 it is obvious that we exponentially converge to \v*), and in particular we can approximate 
T^j, (and hence T^,^^) to any desired accuracy using a finite number of transformations. □ 

To continue, we need to be able to move a vector from subspace A to subspace B without affecting 
the rest of the vectors in subspace A. The following Lemma guarantees that this is possible. 

Lemma A. 2 Under the same conditions of Lemma \A.l\ let {|iti), . . • , \u n )} be an orthonormal basis of 
A and {\v\) 1 . . . , |fm)} be an orthonormal basis of B. Then using a finite product of transformations from 
SU(A), SU(B) and W , it is possible to approximate to any accuracy a transformation T that moves \u\) 
to \v\) , while leaving the vectors \u 2 ), ■ ■ ■ , \u n ) unchanged. 

Proof: For dim A — 1, the problem is trivial since we can simply use Lemma IA.1I Assume then 
that dim A > 1, and define the subspaces A' = sparjfjtti) . . . |u n _i)} and B' = span{|t>2) • • ■ |w m )}. 
By Lemma lA.ll we can approximate a transformation T that takes \u n ) to \v{). Consider now all the 
operators of the form W = f^UV'f with U € SU(A) and V € SU(B'). Clearly, W takes \u n ) to 
itself - and therefore leaves invariant the subspace A 1 © B. We claim that there is at least one such 
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transformation, that also mixes the subspaces A' and B. If this is indeed the case, then we can 

repeat the argument for the subspaces A' and B, which together with the particular transformation 
satisfy the conditions of Lemma lA.ll Consequently, we find a transformation W^ 2 ' that takes 
to \v\) while leaving \u n ) unchanged and mixing the space that is spanned by |i<i), . . . , |u„_2) with B. 
Repeating this again and again, we are left in the end with a transformation W' n ' that transforms 
to |i>i) and is the identity over |it2), ■ ■ ■ , \u n ). Since this recursion has only n steps, it is clear that at any 
step we can approximate the mixing transformation WW to any desired accuracy using a finite product 
of operators from SU(A), SU(B) and W. 

Let us now see why W^- 1 ' must exist. Indeed, if no such transformation exists, then for every two 
operators U £ SU (A) and V £ SU (B') there is no mixing between the subspaces A' and B, and therefore 
there must exist operators V £ SU(A') and V £ SU(B) such that 

T^UV'f = U'V . (43) 

Then, for every \u) £ A and \v) £ B, 

(ulf^UV'fv) = (u\U'Vv) = , (44) 

which implies that for every U £ SU(A) and V £ SU(B'), 

(uTU\V'fv) = . (45) 

Notice that this equation holds also when we take one of the operators, U or V', to be the identity. We 
will use this to show that TA = A - in contradiction with the fact that T\ui) = 

We first deduce that TA' C A. To do this we show that for all \u) £ A', T\u) has no projection on B. 
We already know T\u) has zero projection on \v\) (since \u\) moves to so it suffices to show that 

T\u) has no projection on B' . 

Indeed, by Equation 1451 it would suffice to show that V'Tv) can be made to be an arbitrary vector 
in B' . This follows from the following reasoning. By the same argument as in the proof of Lemma |A. II 
there must be a vector \v*) £ B such that T\v*) £ B. Moreover, T\v*) must be in B' since 

{v±\fv*) = (vxf-^v*) = {ux\v*) = . (46) 

Since f\v*) £ B' , using an arbitrary V £ SU(B'), V'f\v*) can be made to be any vector in B'. 

Now pick any |tt*) £ A' . Then T\u*) £ A, and therefore with an arbitrary transformation U £ SU(A), 
UT\u*) can be made to be an arbitrary vector in A. But since for every |u) £ B we have (u*TU\Tv) = 
then T\v) has no projection on A and consequently, TB = B. But then since T is unitary it follows that 
TA = A - which is the contradiction we were seeking. □ 
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Having proved the last two lemmas, We are now in a position to prove Lemma |4. II 
Proof: Let ■ ■ • , \u n )} be an orthonormal basis of A and similarly {|i>i), . . . , \v n )} an orthonormal 

basis of B. We define the following sequence of subspaces 



B t = Bi-i ® \ui) 



(47) 



for i = 1 • • • n, where Bq = B. 

Let us show how to approximate an arbitrary U G SU(Bi), given SU(B), SU(A), and W. From 
Lemma lA.21 we can use SU(B), SU(A) and W to approximate a transformation T that takes to 
|t>i) while leaving the rest of the vectors in A intact. Therefore T G SU(B\). Now pick an eigenvector 
\t/j) G B-y of U with an eigenvalue e 10 . Using Lemma fA. II with respect to the subspaces span \u\),B and 
the mixing transformation T, we can approximate a transformation G SU{B\) that takes |^>) to 
We first show how to approximate the transformation U\ = W^UW^ 1 . 

We notice that U% has \u{) as an eigenvector with an eigenvalue e lB . Consequently, U\ leaves the 
subspace B invariant. Let V\ be the transformation in SU(B) that satisfies Vi|i>i) — e l6 \vi), Vi\v^) — 
e~ l9 \v2), and leaves the rest of the basis vectors unchanged. Recalling that T takes \u%) to \vi), we see 
that T _1 ViT has |iti) as an eigenvector with eigenvalue e and leaves the subspace B invariant. So the 
only difference between Ui and T~ 1 ViT is some transformation V2 G SU(B), and therefore 



Now that we have generated all transformations in SU(Bi), we can generate SU(B2) using the very 
same procedure - except now B\ plays the role of B and \u,2) plays the role of \u\). In the same method 
we can work our way all up to SU(B n ) = SU(C). □ 

A. 3 Proof of Lemma 14.21 (The Decoupling Lemma) 

Proof: We define two subgroups H a < SU(A) and H b < SU(B) by 



Ui = V 2 T- l VxT , 



(48) 



and consequently, 



U - Wl^T-WxTW^ . 



(49) 




(50) 



(51) 



The theorem will be proved once we show that H a = SU(A) and = SU(B). 
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To do that, we first observe that both H a and H b arc normal subgroups. It is also straightforward 
to see that they are closed. Consider, for example, H a in SU(A): assume that {Uk} in H a converges to 
U € SU (A) . Then there exist series cr„ such that 

lim t q (ctW) = U k , (52) 

n— »oo 

lim n(a { n k) ) = 1 ■ (53) 

n— >oc 

Without loss of generality, we may choose the series such that for every n, /c, 

H-7-6 (o*) - t/fc|| < l/n , 1 1 7i (a*) - l|| < l/n . (54) 

Then the series T a (cr^) — ► U, and we are guaranteed that Tfc(cr£) — * i. 

Now, any non-trivial normal subgroup of SU(N) must be finite 5 . Therefore if we show that H a and 
H b are infinite it will follow that H a — SU(A) and H b = SU(B). To do that, we first show that there is 
an isomorphism of groups M (a 1 — 1 and onto mapping which preserves the action of the group) between 
the coset groups SU{A)/H a and SU{B)/H b . We define M as follows: M(UH a ) = VH b if there exists a 
series {<r n } such that 

r a {a n ) -> U , n(a n ) -> V . (55) 
We first need to show that this function is well defined for all cosets UH a . This follows from: 

• For each coset U H a , there exists at least one series that satisfies the requirements of the definition 
of M. Indeed, pick a series {cr„} such that T a (c n ) — > U. Then the series Tb(a n ) in SU{B) must 
have a limiting point since SU(B) is compact. Therefore there exists a sub-series {&„ k } such that 
t„KJ -> [/ and r b (a„ fc ) V. We have M(C/F a ) = VH b 

• M(UH a ) is defined uniquely. Indeed, assume there exist two series {<J^} and {<x« 2 } such that 

r a (aW)^C/x, ^(o-W) y x , (56) 
r Q (a( 2 ))^L/ 2 , r b (ai 2 ))->T/ 2 , (57) 

with E/i and C/2 in the coset UH a . We will show that V\ and V2 must be in the same coset of H b . 
Denote A = UxU^ . Since H a is normal, A is in H a and we may therefore find a series {c^ 3 ' 1 } 
such that 

r_M? )^U 1 U^ 1 , r b (ai 3 > ) -> 1 . (58) 

5 This follows from the fact that the quotient group SU (N) / Z{SU (N)) is a simple group, and Z(SU{N)) - the center of 
SU(N) - is finite. See for example Theorem 11.26, at page 108 of Ref P3 
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Then looking at the series cr^ =' (^rP) Vi 3 Vi 2 \ we find that 

ra(4 4) ) ^ U^U X U^U 2 = t , r h (a^) -+ V^V 2 • (59) 
Therefore Vf V 2 G if 6 , and so V x R b = V 2 H b . 

It remains to show that M is 1 — 1, onto, and preserves the action of the group. The onto part follows 
if we start with a series that converges to some V in VH b and apply the same reasoning as in the first 
item above. The 1 — 1 part follows if we apply the same reasoning as in the second point above, starting 
with the Vi, V 2 in the same coset instead of the opposite direction. The fact that M is a homomorphism 
follows from the fact that r is a representation. 

Recall now that H a and H b can be either finite groups or equal to their "supergroup" . So there are 
four possibilities: 

1. H a is finite and H b = SU{B). 

2. H b is finite and H a = SU(A). 

3. Both H a and H b are finite. 

4. H a = SU(A) and H b = SU(B). 

The first and second cases are impossible since, for example, if H a is finite and H b = SU(B) then 
SU(B)/H b has only one coset while SU{A)/H a has infinitely many - and thus they cannot be related by 
a 1 — 1 onto map. 

Let us now see why the third case is also impossible. To do this, we will show that M is a continuous 
map. Indeed, assume that UuH a — > UH a (here, convergence means that for some representatives of the 
cosets we have Uk — > U. It is easy to check that this is well defined). Then let M(UkH a ) = VkH b , 
M(UH a ) = VH b . We will show that VtH b — ► VH b . The proof is straightforward, similarly to the proof 
that H a is closed, pick pick a series of series {(Jn" 1 } such that 

lim T a (a^) = U k , (60) 

n— »oc 

lim n(a^) = V k , (61) 

n — >oo 

and without any loss of generality we assume that 

\\T a (o*)-U k \\<l/n , ||7i(o*)-Vfc||<l/n. (62) 

Then since U k — » U, we have r a (cr^) — > U, and since M(UH a ) = VH b we can find a sub-series T b (a k e ) 
that converges to some V G VH b . In order not to overload the notation, let us re-define k to be that 



26 



sub-series. We now claim V& — ► V. Indeed, for each e > 0, we can choose K such that for each k > K, 
l/k < e/2 and ||r 6 (cr£) - V\\ < e/2. Then 

\\V k - V\\ <\\V k - n «) || + ||r 6 «) - V-|| < e . (63) 

Now H a and iff, are closed normal subgroups, and therefore SU(A)/H a and SU(B)/H\, are Lie 
groups themselves (see for example, Theorem 3.64, pp 124, in |19|L Furthermore, since every continuous 
homomorphism between Lie groups is also smooth (see for example, Theorem 3.39, pp 109, in |19|h 
we have found a smooth diffcomorphism (1 — 1 homcomorphism) between two differentiable manifolds. 
However, since both H a and Hi, are finite then 

dim SU(A) /H a = dim SU(A) £ dim SU(B) = dim SU(B) /H b , (64) 

and it is therefore impossible to find a diffeomorphism between the two manifolds. □ 
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